A New Privacy-Preserving Data Publishing Algorithm Utilizing Connectivity-Based Outlier Factor and Mondrian Techniques

نویسندگان

چکیده

Developing a privacy-preserving data publishing algorithm that stops individuals from disclosing their identities while not ignoring utility remains an important goal to achieve. Because finding the trade-off between privacy and is NP-hard problem also current research area. When existing approaches are investigated, one of most significant difficulties discovered presence outlier in datasets. Outlier has negative impact on utility. Furthermore, k-anonymity algorithms, which commonly used literature, do provide adequate protection against data. In this study, new anonymization devised tested for boosting by incorporating detection mechanism into Mondrian algorithm. The connectivity-based factor (COF) detect outliers. selected because its capacity anonymize multidimensional meeting needs real-world COF, other hand, discover outliers high-dimensional datasets with complicated structures. proposed generates more equivalence classes than provides greater previous algorithms based k-anonymization. addition, it outperforms discernibility metric (DM), normalized average class size (Cavg), global certainty penalty (GCP), query error rate, classification accuracy (CA), F-measure metrics. Moreover, increase values GCP rate metrics demonstrates facilitates obtaining higher grouping closer points when compared algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy - Preserving Data Publishing

The success of data mining relies on the availability of high quality data. To ensure quality data mining, effective information sharing between organizations becomes a vital requirement in today's society. Since data mining often involves person-specific and sensitive information like medical records, the public has expressed a deep concern about their privacy. Privacy-preserving data publishi...

متن کامل

Privacy Preserving Techniques in Social Networks Data Publishing - A Review

Development of online social networks and publication of social network data has led to the risk of leakage of confidential information of individuals. This requires the preservation of privacy before such network data is published by service providers. Privacy in online social networks data has been of utmost concern in recent years. Hence, the research in this field is still in its early year...

متن کامل

A Survey on Big Data & Privacy Preserving Publishing Techniques

Big data describes very large data sets that have more diverse and complicated structure like weblogs, social media, email, sensors, and photographs. These less structured data and distinctiveness characteristics from traditional databases typically associated with extra complications in storing, analyzing and applying further procedures or extracting results. Big data analytics is the process ...

متن کامل

Towards Privacy Preserving Data Publishing∗

High quality and useful knowledge is to be found in the integrated data from various organizations, and the discovered knowledge is essential for building intelligent systems such as business analysis and health surveillance. However, concern about breaching privacy is a major obstacle of this process. This project aims to develop new efficient and effective techniques for privacy protection in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers, materials & continua

سال: 2023

ISSN: ['1546-2218', '1546-2226']

DOI: https://doi.org/10.32604/cmc.2023.040274